A. 


hAR 
hPR 
hMR 
hGR 
hER 


A Complement 

5' -ACC 
** 

TGT 
*** 

GAG 
* 

GGC 
* 

TGT 
** 

AAG 
*** 

GTC 

TTC 
*** 

TTC 
★** 

AAA 

AG- 3* 
** 

(100%) 

(X) 

(11) 

(4) 

(5) 

(6) 

(3, 17) 
(17) 

ACA 

TGT 

GGA 

AGC 

TGC 

AAG 

GTC 

TTC 

TTC 

AAA 

AG 

(84%) 

ACC 

TGT 

GGG 

AGC 

TGT 

AAG 

GTC 

TTC 

TTT 

AAG 

AG 

(88%) 

ACC 

TGT 

GGC 

AGC 

TGC 

AAA 

GTT 

TTC 

TTC 

AAA 

AG 

(81%) 

ACT 

TGT 

GGA 

AGC 

TGT 

AAA 

GTT 

TTC 

TTC 

AAA 

AG 

(81%) 

TCC 

TGT 

GAG 

GGC 

TGT 

AAG 

GCC 

TTC 

TTC 

AAG 

AG 

(91%) 

ACG 

TGT 

GAA 

GGC 

TGC 

AAG 

GGT 

TTC 

TTT 

AGA 

AG 

(78%) 

GCC 

TGT 

GAG 

GGC 

TGC 

AAG 

GGC 

TTC 

TTC 

CGC 

CG 

(78%) 


■ARHFLIH -X 


E 
x 

ARHFLI 5'| S^S^l 1 1 1 1 ^3* 

t t 500 1000 TGA 


c. 


DNA- Binding Domain 


hAR 

hPR 

hMR 

hGR 

hER 

cVDR 

hT3R 


(aa 567) 

(aa 603) 

(aa 421) 

(aa 185) 


VERBA (aa 
hRAR (aa 


(aa 102) 
37) 
58) 


+ 



+ 






10 







+ 


20+ 







30 


c 

L 

I 

c 

G 

D 

E 

A 

S 

G 

c 

H 

Y 

G 

A 

L 

T 

C 

G 

S 

C 

K 

V 

F 

F K 

R 

A 

AEG 

(100%) 

c 

L 

1 

c 

G 

D 

E 

A 

s 

G 

C 

H 

Y 

G 

V 

L 

T 

c 

G 

S 

C 

K 

V 

F 

F K 

R 

A 

MEG 

(94%) 

c 

L 

V 

c 

G 

D 

E 

A 

s 

G 

C 

H 

Y 

G 

V 

V 

T 

c 

G 

S 

C 

K 

V 

F 

F K 

R 

A 

V E G 

(87%) 

c 

L 

V 

c 

S 

D 

E 

A 

s 

G 

C 

H 

Y 

G 

V 

L 

T 

c 

G 

S 

c 

K 

V 

F 

F K 

R 

A 

V E G 

(87%) 

c 

A 

V 

c 

N 

D 

Y 

A 

s 

G 

Y 

H 

Y 

G 

V 

W 

S 

c 

E 

G 

c 

K 

A 

F 

F K 

R 

S 

I Q G 

(55%) 

c 

G 

V 

c 

G 

D 

R 

A 

T 

G 

F 

H 

F 

N 

A 

M 

T 

c 

E 

G 

c 

K 

G 

F 

F R 

R 

S 

M K R 

(48%) 

c 

V 

V 

c 

G 

D 

K 

A 

T 

G 

Y 

H 

Y 

R 

C 

I 

T 

c 

E 

G 

c 

K 

G 

F 

F R 

R 

T 

I Q K 

(48%) 

c 

V 

V 

c 

G 

D 

K 

A 

T 

G 

Y 

H 

Y 

R 

C 

I 

T 

c 

E 

G 

c 

K 

S 

F 

F R 

R 

T 

I Q K 

(48%) 

c 

F 

V 

c 

Q 

D 

K 

S 

S 

G 

Y 

H 

Y 

G 

V 

S 

A 

c 

E 

G 

c 

K 

G 

F 

F R 

R 

S 

I Q K 

(45%) 


hAR 

K 

Q 

K 

hPR 

Q 

H 

N 

hMR 

Q 

H 

N 

hGR 

Q 

H 

N 

hER 

H 

N 

D 

cVDR 

K 

A 

M 

hT3R 

N L H 

P 

S 

VERBA 

N L H 

P 

T 

hRAR 

N 

M 

V 


+ 40 + 

YLCASRNDCTIDK 
YLCAGRNDCIVDK 
YLCAGRNDCI 1DK 
YLCAGRNDCIIDK 
YMCPATNQCTIDK 
FTCPFNGDCKITK 
YSCKYEGKCVIDK 
TSCTYDGCCVIDK 
YTCHRDKNCI INK 


50 + + 
FRRKNCPS CRL 
IRRKNCPACRL 
IRRKNCPACRL 
IRRKNCPACRY 
NRRKSCQACRL 
DNRRHCQACRL 
VTRNQCQECRF 
ITRNQCQLCRF 
VTRNRCQYCRL 


60+ 




R K C 

Y E A G 

M 

(100%) 

R K C 

C Q A G 

M 

(71%) 

QKC 

L Q A G M 

(71%) 

R K C 

L Q A G 

M 

(71%) 

R K C 

Y E V G 

M 

(63%) 

K R C 

V D I G 

M 

(40%) 

K K C 

I Y V G 

M 

(40%) 

K K C 

I S V G 

M 

(37%) 

QKC 

F E V G 

M 

(43%) 




10 BO 30 40 50 60 

GAGCTCTGGA CAAAATTGAG CGCCTATGTG TACATGGCAA GTGTTTTTAG TGTTTGTGTG 
CTCGAGACCT GTTTTAACTC GCGGATACAC ATGTACCGTT CACAAAAATC ACAAACACAC 

70 80 90 100 HO ISO 

TTTACCTGCT TGTCTGGGTG ATTTTGCCTT TGAGAGTCTG GATGAGAAAT GCATGGTTAA 
AAATGGACGA ACABACCCAC TAAAACGGAA ACTCTCAGAC CTACTCTTTA CGTACCAATT 

ISO 140 ISO t 160 - 170 180 

ABGCAATTCC AGACAGGAAG AAAGGCAGAG AAGAGGGTAG AAATGACCTC TGATTCTTGG 
TCCGTTAAGG TCTGTCCTTC TTTCCGTCTC TTCTCCCATC TTTACTGGAG ACTAAGAACC 

190 EOO E10 E20 B30 E40 

GGCTGAGGGT TCCTAGAGCA AATGGCACAA T6CCACGAGG CCCGATCTAT CCCTATGACG 
CCGACTCCCA AGGATCTCGT TTACCGTGTT ACGGTGCTCC GGGCTAGATA GGGATACT6C 

250 260 E70 2BO H9C> 300 

GAACTCTAAG GTTTCAGCAT CAGCTATCTG CTGGCTTGGT CACTGGCTTG CCTCCTCAGT 
CTTGAGATTC CAAAGTCGTA GTCGATAGAC GACCGAACCA GTGACCGAAC GGAGGAGTCA 

3 1 o 3EO 330 340 350 360 

TTGTAGGAGA CTCTCCCACT CTCCCATCTG CGCGCTCTTA TCAGTCCT6A AAAGAACCCN 
AACATCCTCT GAGAGGGTGA GAGGGTAGAC GCGCGAGAAT AGTCAGGACT TTTCTTGGGN 

qyo 380 390 400 41 0 4EO 

TGGCNAGCCA GGAGCNAGGT ATTCNTATCG TCCTTTTCNT CCTCCTNGCC TCACCTNGTT 
ACCGNTCGGT CCTCGNTCCA TAAGNATAGC AGGAAAAGNA GGAGGANCGG AGTGGANCAA 

43 n ^O 450 460 470 480 

GNTTTTTAGA TTGGNCTTNG NAACCAAATT TGTATGCTGG CCTCCAGGAA ATCTG6AGCC 
-CNAAAAATCT AACCNGAANC NTTGGTTTAA ACATACGACC GGAGGTCCTT TAGACCTCGG 

A90 500 51 0 5EO 53 O 540 

TGGCGCCTAA ACCTTGGTTT AGGAAAGCAG GAGCTATTCA GGAAGCAGGG TCCTCCAGGG 
ACCGCGGATT TGGAACCAAA TCCTTTCGTC CTCGATAA6T CCTTCGTCCC AGGAGGTCCC 


550 


M „ 560 570 5BO 590 600 

CTAGARCTA6 CCTCTCCTGC CCTCGCCCAC GTGCGCCAGC ACTTGTTTCT CCAAAGCNAC 
GATCTCGATC GGAGA6GACG GGAGCGGGTG CACGCGGTCG TGAACAAAGA GGTTTCGNTG 


610 


620 630 640 650 660 

TAGGCAGGCG TTAGCGCGCG GTGAGGGGAG GGGAGAAAAG GAAAGGGGAG GGGAGGGAAA 
ATCCGTCCGC AATCGCGCGC CACTCCCCTC CCCTCTTTTC CTTTCCCCTC CCCTCCCTTT 


670 680 690 700 710 7EO 

AGGAGGTGGG AAGGCAAGGA GGCCGGCCNG GTGGGGGCGG GACCCGACTC GCANNAACTG 
TCCTCCACCC TTCCGTTCCT CCGGCCG6NC CACCCCCGCC CTGGGCTGAG CGTNNTTGAC 

730 740 750 760 770 780 

TTGCATTTGC TCTCCACCTC CCAGCGCCCC CTCCGAGATC CCGGGGAGCC AGCTTGCTGG 
AACGTAAACG AGA6GTGGAG GGTCGCGGGG GAGGCTCTAG GGCCCCTCGG TCGAACGACC 

790 800 810 8EO 830 840 

GAGAGCGGGA ACGGTCCGGA GCAAGCCCAG AGGCAGAGGA GGCGACAGAG GGAAAAAGGG 
CTCTCGCCCT TGCCAGGCCT CGTTCGGGTC TCCGTCTCCT CCGCTGTCTC CCTTTTTCCC 

850 860 B70 880 890 *?00 

CCCNAGCTAG CCGCTCCAGT GCTGTACAGN AGCCGAAGGA CGCACCACGC CAGCCCCAGC 
GGGNTCGATC GGCGAGGTCA CGACATGTCN TCGGCTTCCT GCGTGGTGCG GTCGGGGTCG 


91 0 980 930 

CCGGCTCCAG CGACAGCNAA CGCCTCTTGC 
GGCCGAGGTC GCTGTCGNTT GCG6AGAACG 

970 9BO 99C 

CTTTCCTCTT CGGTGAAGTT TTTAAAAGCT 
GAAAGGAGAA GCCACTTCAA AAATTTTCGA 

1030 1040 1050 

CCTGGTAGGA CTGACGGCTG CCTTTGTCCT 
GGACCATCCT GACTGCCGAC GGAAACAGGA 

1090 1100 lllO 

GCCTTCCCCC CCTCCCCCGT CTTCTCTCCC 
CGGAAGGGGG GGASGGGGCA GAAGAGAGGG 

1150 1160 1170 

ACCCCCCTCA CCACCCTTCT CCCCACCCGC 
TGGGGGGAGT GGTGGGAAGA GGGGTGGGCG 

ieio ieeo 1230 

GNCCGAGTTT GCAGAGAGGT AACTCCCTTT 
CNGGCTCAAA CGTCTCTCCA TTGAGGGAAA 

1270 1£80 1890 

TGCAAAGAAG GCTCTTAGGA GCAGGCGACT 
ACGTTTCTTC CGAGAATCCT CGTCCGCTGA 

1330 1340 1350 

CNGCCTGGTT AGGCTGCACG CGGAGAGAAC 
GNCGGACCAA TCCGACGTGC GCCTCTCTTG 

1390 1400 1410 

CCTCCTGCCT TCCCCACCCC GAGTGCGGAG 
GGAGGACGGA AGGGGTGG6G CTCACGCCTC 

1450 1460 147D 

GTCTTCAGTA GCCAAAAAAC AAAACAAACA 
CAGAAGTCAT CGBTTTTTTG TTTTGTTTGT 

1510 1580 1530 

ATAATAACTC AGTTCTTATT TGCACCTACT 
TATTATTGAG TCAAGAATAA ACGTGGATGA 

1570 1580 1590 

ATTTTGTTTT TTTCTTTTAA GATCTGGGCA 
TAAAACAAAA AAAGAAAATT CTAGACCCGT 

1630 1640 1650 

GACAGACTGT GAGCCTAGCA GGGCAGATCT 
CTGTCTGACA CTCGGATCGT CCCGTCTAGA 

1690 1700 1710 

CTTTGAGGCT GTCAGAGCGC TTTTTGCGTG 
GAAACTCCGA CAGTCTCGCG AAAAACGCAC 


940 950 960 

ANGCGTTCGA AGCCGCCGCC CGGAGCTGCC 
TNCGCAAGCT TCGGCGGCG6 GCCTCGAC6G 

1000 1010 lOSO 

GCTAAAGACT CGGAGGAAGC AAGGAAAGTG 
CGATTTCTGA GCCTCCTTCG TTCCTTTCAC 

1060 1070 lOBO 

CCTCCTCTCC ACCCCGCCTC CCCCCACCCT 
GGAGGAGAGG TGGGGCGGAG GGGGGTGGGA 

nao 1130 1140 

GCAGCTGCCT CAGTCGGCTA CTCTCAGCCA 
CGTCGACGGA GTCAGCCGAT GAGAGTCGGT 

HBO 1190 1SOO 

CCCCCCGCCC CCGTCGGCCC AGCGNTGNCA 
GGGGGGCGGG GGCAGCCGGG TCGCNACNGT 

1240 1B50 I860 

GGCTGCGAGC GGGCGAGNCT AGCTGCACAT 
CC6ACGCTCG CCCGCTCNGA TCGACGTGTA 

1300 1310 1320 

GGGGAGCGGC TTCAGCACTG CAGCCACGAC 
CCCCTCGCCG AAGTCGTGAC GTCGGTGCTG 

1360 1370 1380 

CCTCTGTTTT CCCCCACTCT CTCTCCACCT 
GGAGACAAAA GGGGGTGAGA GAGAGGTGGA 

1420 1430 1440 

CCAGAGATCA AAAGATGAAA AGGCAGTCAG 
GGTCTCTAGT TTTCTACTTT TCCGTCAGTC 

1480 1490 1500 

AAAACAAAAA AGCCGAAATA AAAGAAAAAG 
TTTTGTTTTT TCGGCTTTAT TTTCTTTTTC 

1540 1550 1560 

TCAGTGGACA CTGAATTTGG AAGGTGGAGG 
AGTCACCTGT GACTTAAACC TTCCACCTCC 

1600 1610 1680 

TCTTTTGAAT CTACCCTTCA AGTATTAAGA 
AGAAAACTTA GATGGGAAGT TCATAATTCT 

I66O 1670 16BO 

TGTCCACCGT GTGTCTTCTT CTGCACGAGA 
ACAGGTGGCA CACAGAAGAA GACGTGCTCT 

1720 1730 1740 

GTTGCTCCCG CAAGTTTCCT TCTCTGGAGC 
CAACGAGGGC GTTCAAAGGA AGAGACCTCG 


1750 1760 1770 1780 1790 1800 

TTCCCGCAGG TGGGCAGCTA GCTGCAGCGA CTACCGCATC ATCACAGCCT GTTGAACTCT 
AAGGGCGTCC ACCCGTCGAT CGACGTCGCT GATGGCGTAG TAGTGTCGGA CAACTTGAGA 


1 B 1 O 1 BEO 1 830 1 B40 1 850 1 86C> 

TCTGAGCAAG AGAAGGGGAG GCGGGGTAAG GGAAGTAGGT GGAAGATTCA GCCAAGCTCA 
AGACTCGTTC TCTTCCCCTC CGCCCCATTC CCTTCATCCA CCTTCTAAGT CG6TTCGAGT 

1870 18BO 1B90 1900 1910 19EO 

AGGATGGAAG TGCAGTTAGG GCTGGGAAGG GTCTACCCTC GGCCGCCGTC CAAGACCTAC 
TCCTACCTTC AC6TCAATCC CGACCCTTCC CAGATGGGAG CC6GCGGCAG GTTCTGGATG 

1930 1940 1950 I960 1970 1980 

CGAGGAGCTT TCCAGAATCT GTTCCAGAGC GTGCGCGAAG T6ATCCAGAA CCCGGGCCCC 
GCTCCTCGAA AGGTCTTAGA CAAGGTCTCG CACGCGCTTC ACTAGGTCTT GGGCCCGGGG 

1990 £000 EOIO EOEO £030 £040 

AGGCACCCAG AGGCCGCGAG CGCaCCACCT CCCGGCGCCA GTTTGCTGCT GCTGCAGCAG 
TCCGTGGGTC TCCGGCGCTC GCGTOTGGA GGGCCGCGGT CAAACGACGA CGACGTCGTC 

£050 E060 2070 E080 E090 El OO 

CAGCAGCAGC AGCAGCAGCA GCAGCAGCAG CAGCAGCAGC AGCAGCAGCA GCAGCAGCAG 
GTCGTCGTCG TCGTCGTCGT CGTCGTCGTC GTCGTCGTCG TC6TCGTCGT CGTCGTCGTC 


EllO E120 E130 

CAGCAGCAAG AGACTAGCCC CAGGCAGCAG 
GTCGTCGTTC TCTGATCGGG GTCCGTCGTC 


£140 E150 2160 

CAGCAGCAGC AGGGTGAGGA TGGTTCTCCC 
GTCGTCGTCG TCCCACTCCT ACCAAGAGGG 


P 3 70 El BO E190 EEOO ££l6 EEEO 

CAAGCCCATC GTAGAGGCCC CACAGGCTAC CTGGTCCTGG ATGAGGAACA GCAACCTTCA 

G1TCG3GTAG CATCTCCGGG GTGTCCGATG GACCAGGACC TACTCCTTGT CGTTGGAAG » 

EE30 ££40 ££50 EE60 ££70 E2SO 

CAGCCGCAGT CGGCCCTGGA GTGCCACCCC GAGAGA6GTT GCGTCCCAGA GCCTGGAGCC 

GTCGGCGTCA GCCGGGACCT CACGGTGGGG CTCTCTCCAA CGCAGGGTCT CGGACCTCGt? 

E£90 £300 E31 O E3EO £330 £340 

GCC5TG5CCG CCAGCAAGGG GCTGCCGCAG CAGCTGCCAG CACCTCCGGA CGAGGATGAC 

CBSCACCLV^C GGTCGTTCCC CGACGBCGTC GTCGACGGTC GTGGAGGCCT GCTCCTACTG 

£350 E360 £370 S3BO . 2390 £400 

TCAGCTSCCC CATCCACGTT GTCCCTGCTG GGCCCCACTT TCCCCGGCTT AAGCAGCTGC 

AGTCGAC6GG GTAGGTGCAA CAGGGACGAC CCGGGGTGAA AGGGGCCGAA TTCGTCGACG 

£4 10 E4EO £430 £440 £450 £460 

TCCGCTGACC TTAAAGACAT CCTGAGCGAG GCCAGCACCA T6CAACTCCT TCAGCAACAG 

AGGCGACTG3 AATTTCTGTA GGACTCGCTC CGGTCGTGGT ACGTTGAGGA AGTCGTTGTC 

E'47C> E4BO £490 E5O0 £510 E5EO 

CAGCAGGAAG CAGTATCCGA AGGCAGCA6C AGCGGGAGAG CGAGGGAGGC CTC6GGGGCT 

GTCGTCCTTC GTCATAGGCT TCCGTCGTCG TCGCCCTCTC GCTCCCTCCG GAGCCCCCGA 

E530 £540 £550 E560 E570 E5BO 

CCCACTTCCT CCAAGGACAA TTACTTAGGG GGCACTTCGA CCATTTCTGA CAACGCCAAG 

GGGTGAAGGA GGTTCCTGTT AATGAATCCC CCGTGAAGCT GGTAAAGACT GTTGCGGTTC 

E590 E600 £610 £6£0 £630 £640 

GAGTTGTGTA AGGCAGTGTC GGTGTCCATG GGCCTGGGTG TGGAGGCGTT GGAGCATCTG 

CTCAACACAT TCCGTCACAG CCACAGGTAC CCGGACCCAC ACCTCCGCAA CCTCGTAGAC 

£650 E660 E670 E6B0 E690 E70O 

AGTCCAGGGG AACAGCTTCG GGGGGATTGC ATGTACGCCC CACTTTTGGG AGTTCCACCC 

TCAGliTCCCC TTGTCGAAGC CCCCCTAACG TACAT6CGGG GTGAAAACCC TCAAGGTGGG 


GCTGT 
CGACAC 


avjO 8720 2730 27<K> £750 B7bO : ^ r ^»^g; 

G^CTC CCACTCCTTB TGCCCCATTG 6CCGAATGCA AAGGTTCTCT GCT AG ACGAC (. ^ji?'. T%E fc ."" 
>CCGAG GGTGAGGAAC ACGGGGTAAC CGGCTTACGT TTCCAAGAGA CGATCT6CTG v".? "'^^s - 


2770 27BO 2790 2BOO 28 10 BBBO^ 

AGCGCAGGCA AGAGCACTGA AGATACTGCT GAGTATTCCC CTTTCAAGGG AGGTTACACC, 
TCGCGTCCGT TCTCGTGACT TCTATGACGA CTCATAAGGG GAAAGTTCCC TCCAATBTBQ J^,- 

2B3<;! 28<*0 2850 2B6C> 2870 ^B8O"|^0':* ; IP^ : v ; ^^^ 

AAAGGGCTAG* AAGSCGAGAG CCTAGGCTGC TCTGGCAGCG CTGCAGCAGG GAGCTCCGQB.^: jjj^i j&^ift; 
n iCCCGATC TTCCGCTCTC GGATCCGACG AGACCGTCGC GACGTCGTCC CTCSAteCt^^^XST v 332p 

-^-^^^^^ 
£390 2 0 00 E910 2920 2930 - H940 £r;S-5/^/iS 

AC^CTTGAAC TGCCGTCT AC CCTGTCTCTC TACAAGTCCG GAGCACTGGA CGAGGCAGCT^f^&^^>^t 
TGT '.3AACTTG ACGGCAGAT G GGACAGAGAG ATGTTCAGGC CTCGTGACCT GCTCCGTCGA^.: 

P950 2960 2970 H9B0 2990 - 3000^--"';\^^..*,^^rS 

RCG"< AIXA6A GTCGCSACTA CTACAACTTT CCACTGGCTC TGGCCGGACC BCCGCCCDCT^ 
CGJ^.TGGTCT CAGCGCTGAT GATGTTGAAA GGTGACCGAG ACCGBCCTGG CGGCGGGGGA ; \. ^—V.- 

3010 3020 3030 30<f0 3050 3060 .v : 

CC5CCGCCTC CCCATCCCCA CGCTCGCATC AAGCTGGAGA ACCCGCTGGA CT ACGGCAGC . ~ ^/ 
GGCGGCGGAG GGGTAGGGGT GCGAGCGTAG TTCGACCTCT TGGGCGACCT GAT6CCGTCG ^t*; ^ -1:4^ 

7: o 7 O 3 0 9 O 3 09 O 3 1 O O 3 X 1 0 31 20, , ^ ± ir ^ ; *_ ^ ,43^. . 

GCn GL-GCGG CT*££pGCGGC C^^\JTGCC6C T^TGGGGACC TGGCGAGCCT GCATGGCGCG " f '-X? 
CGGACC-CGCC GAfGjCCGCCG CO^ACGGQfr ArACCCCT GG ACCGCTCGGA CGTACC6CGC - V. -j^"* 

" :'*'■■ ^ : ^fe* 

3130 3140 3150 3360 3170 3180 V ■ . ' : - * 

GC-TGCtrsC-GG GACCCGGTT C TGGGTCACCC TCAGCCGCCG CTTCCTCATC CTGGCACACT. . , t- - . f.': y r 

-rf;CAI pTt:GCC CTGGGCCAAG ACCCAGTGGG AGTCGGCGGC GAAGGAGJAG GACCGTGTG A ' ' >r : 

3J90 3200 321 0 3220 3230 3240 " '? ' '<;.\ 

CTCTTCOCAG CCGAAGAAGG CCAGTTGTAT GGACCGTGTG GTGGTGGTGG GGGTGGTGBC ? * * '\ ^ >v ;- : 

GACv^AGTGTC GGCTTCTTCC GGTCAACATA CCTGGCACAC CACCACCACC CCCACCACCG \ r«rS- •> 

3550 32*0 3270 3280 3290 3300 ^ ' * y 

GGCGGCGGCG GCGGCGGCGG CGGCGGCGGC GGCGGC6GCG GCGGCGGCGG CGAGGCGGGA 
Cr;Gr.c.Gf".r # GC CGCCGCCGCC GCCGCCGCCG CCGCCGCCGC CGCCGCCGCC GCTCCGCCCT / 

3310 3320 3330 33*40 3350 3360 

GOnTftGCCC CnACGGCTA CACTCGGCCC CCTCAGGGGC TGGCGGGCCA GG A A AGCGAC t - " kl^ 

CGTiCAl CGGG GGA'T GCCGAT GTGAGCCGGG GGAGTCCCCG ACCGCCCGGT CCTTTCGCTG . I\ 

* " s * * r ' - - - ":■*' ; 

^37M 33BO 3390 3^00 3^10 3^20 . \ .* ■ "'i/jr 

T7 C ACCGCAG CTGATGTGTG GTACCCTGGC GGCATGGTGA GCAGAGTGCC -CTATCCCAGT : . 5 . J -V." , 

AAGTGGCGlR G ACT AC At; AC CATGGGACCG CCGTACCACT C6TCTCACGG GAT AGGGTCA ■* - " - ' 

3^30 3»^0 3^50 3^60 3^70 3^BO'' 

Crcr.fJTGTG TCA^>AAGCGA AATGGGCCCC TGGATGGATA GCTACTCCCG GGAACCTTAC 
GGGTGAACAC AGT TTTCGCT TTACCCGGGG ACCTACCTAT CGATGAGGGC CCTTGGAATG - „ •'- 

3A70 3500 3510 3520 3530 35<fO ~ ' ^i'^ '■■ 

FGGGACA7GC GTTTGGAGAC TGCCAGGGAC CATGTTTTGC CCATTGACTA TTACTTTCCA " vj:% 

CCCCT51ACP CAAACCTC1G .ACGGTCCCTG GTACAAAACG GGTAACTGAT AATGAAAGGT * -7;" 

3550 3560 3570 35BO 3590 360O 

CCr.L^GAAGA CCTGCCTGAT CTGTGGAGAT GAAGCTTCTG GGT6TCACTA TGGAGCTCTC 
GGGGTCTTCT GGACGGACTA GACACCTCTA CTTCGAAGAC CCACAGT6AT ACCTCGAGAG 


36 i 0 

3620 

3630 

3640 

3650 

3660 

ACATGTGGAA 

GCTGCAAGGT 

CTTCTTCAAA 

AGAGCCGCTG 

AAGGGAAACA 

GAAGTACCTG 

TGTACACCTT 

CGACGTTCCA 

GAAGAAGTTT 

TCTCGGCGAC 

TTCCCTTTGT 

CTTCATGGAC 

3670 

36B0 

3690 

3700 

3710 

3720 

TGCGCCAGCA 

GAAATGATTG 

CACTATTGAT 

AAATTCCGAA 

GGAAAAATTG 

TCCATCTTGT 

ACGCGGTCGT 

CTTTACTAAC 

GTGATAACTA 

TTTAAGGCTT 

CCTTTTTAAC 

AGGTAGAACA 

3730 

374-0 

3750 

3760 

3770 

3780 

CGTCTTCGGA 

AATGTTATGA 

AGCAGGGATG 

ACTCTGGGAG 

CCCGGAAGCT 

GAAGAAACTT 

GCAGAAGCCT 

TTACAATACT 

TCGTCCCTAC 

TGAGACCCTC 

GGGCCTTCGA 

CTTCTTTGAA 

37 90 

3800 

3810 

3820 

3830 

3840 

GGTAATCTGA 

AACTACAGGA 

GGAAGGAGAG 

GCTTCCAGCA 

CCACCAGCCC 

CACTGAGGAG 

CC ATT AG ACT 

TTGATGTCCT 

CCTTCCTCTC 

CGAAGGTCGT 

GGTGGTCGGG 

GTGACTCCTC 

3850 

3860 

3870 

38BO 

3890 

3900 

ACAACCCAGA 

AGCTGACAGT 

6TCACACATT 

GAAGGCTATG 

AATGTCAGCC 

CATCTTTCTG 

TGTTGGGTCT 

TCGACTGTCA 

CAGTGTGTAA 

CTTCCGATAC 

TTACAGTCGG 

GTAGAAAGAC 

39 lO 

3920 

3930 

3940 

3950 

3960 

AATGTCCTGG 

AAGCCATTGA 

GCCAGGTGTA 

GTGT6TGCTG 

GACACGACAA 

CAACCAGCCC 

TTACAGGACC 

TTCGGTAACT 

CGGTCCACAT 

CACACACGAC 

CTGTGCTGTT 

GTTGGTCGGG 


3970 

3980 

3990 

4000 

-4010 

A020 

GACTCCTTTG 

CAGCCTTGCT 

CTCTAGCCTC 

AATGAACTGG 

GAGAGAGACA 

GCTTGTACAC 

CTGAGGAAAC 

GTCGGAACGA 

GAGATCGGAG 

TTACTTGACC 

CTCTCTCTGT 

CGAACATGTG 

4030 

4040 

* 4050 

4O60 

4070 

4080 

GTGGTCAAGT 

GGGCCAAGGG 

CTTGCCTGGC 

TTCCGCAACT 

TACACGTGGA 

CGACCAGATG 

CACCAGTTCA 

CCCGGTTCCC 

GAACGGACCG 

AAGGCGTTGA 

ATGTGCACCT 

GCTGGTCTAC 

4090 

4100 

41 lO 

^lSO 

4130 

4140 

GCTGTCATTC 

AGTACTCCTG 

GATGGGGCTC 

ATGGTGTTTG 

CCATGGGGTG 

GCGATCCTTC 

CGACAGTAAG 

TCATGAGGAC 

CTACCCCGAG 

TACCACAAAC 

GGTACCCGAC 

CGCTAGGAAG 

4150 

4160 

4170 

*flBO 

4190 

4200 

ACCAATGTCA 

ACTCCAGGAT 

GCTCTACTTC 

GCCCCTGATC 

TGGTTTTCAA 

TGAGTACCGC 

TGGTTACAGT 

TGAGGTCCTA 

CGAGATGAAG 

CGGGGACTAG 

ACCAAAAGTT 

ACTCATGGCG 

4210 

4220 

4230 

*t£40 

42 SO 

4260 

ATGCACAAGT 

CCCGGATGTA 

CAGCCAGTGT 

GTCCGAATGA 

GGCACCTCTC 

TCAAGAGTTT 

TACGTGTTCA 

GGGCCTACAT 

GTCGGTCACA 

CAGGCTTACT 

CCGTGGAGAG 

AGTTCTCAAA 

4270 

4280 

4290 

430O 

4310 

4320 

GGATGGCTCC 

AAATCACCCC 

CCAGGAATTC 

CTGTGCATGA 

AAGCACT6CT 

ACTCTTCAGC 

CCTACCGAGG 

TTTAGTGGGG 

GGTCCTTAAG 

GACACGTACT 

TTCGTGACGA 

TGAGAAGTCG 

4330 

4340 

4350 

4360 

**370 

43BO 

ATTATTCCAG 

TGGATGGGCT 

GAAAAATCAA 

AAATTCTTTG 

ATGAACTTCG 

AAT6AACTAC 

TAATAAGGTC 

ACCTACCCGA 

CTTTTTAGTT 

TTTAAGAAAC 

TACTTGAAGC 

TTACTTGATG 

4390 

4400 

4410 

^42C> 

4430 

4440 

ATCAAGGAAC 

TCGATCGTAT 

CATTGCATGC 

AAAAGAAAAA 

ATCCCACATC 

CTGCTCAAGA 

TAGTTCCTTG 

AGCTAGCATA 

GTAACGTACG 

TTTTCTTTTT 

TAGGGTGTAG 

GACGAGTTCT 


4450 4-460 4470 4^480 ^490 4500 

CGCTTCTACC AGCTCACCAA GCTCCTGGAC TCCGTGCAGC CTATTGCGAG AGAGCTGCAT 
GCGAAGATGG TCGAGTGGTT CGAGGACCTG AGGCACGTCG GATAACGCTC TCTCGACGTA 


^5 i o 45EO 4530 4540 4550 4560 

CAGTTCACTT TTGACCTGCT AATCAAGTCA CACATGGTGA GCGTGGACTT TCCG6AAATG 
GTCAAGTGAA AACTGGACGA TTAGTTCAGT GTGTACCACT CGCACCTGAA AGGCCTTTAC 

4570 4580 4590 4600 46 lO 46EO 

ATGGCAGAGA TCATCTCTGT GCAAGTGCCC AAGATCCTTT CTGGBAAAGT CAAGCCCATC 
TACCGTCTCT AGTAGAGACA CGTTCACGGG TTCTAGGAAA GACCCTTTCA GTTCGGGTAG 

A630 4640 4650 466C 4670 46B0 

TATTTCCACA CCCAGTGAAG CATTGGAAAC CCTATTTCCC CACCCCAGCT CATGCCCCCT 
ATAAAGG7GT GGGTCACTTC GTAACCTTTG GGATAAAGGG GTGGGGTCGA GTACGGGGGA 

4690 4700 47 lO 47£0 4730 4740 

TTCAGATGTC TTCTGCCTGT TATAACTCTG CACTACTCCT CTGCAGTGCC TTGGGGAATT 
AAGTCTACAG AAGACGGACA ATATTGAGAC GTGATGAGGA GACGTCACGG AACCCCTTAA 

A750 4760 4770 4780 4790 4800 

TCCTCTATTG ATGTACAGTC TGTCATGAAC ATGTTCCTGA ATTCTATTTG CTGGGCTTTT 
AGGAGATAAC TACATGTCAG ACAGTACTTG TACAAGGACT TAAGATAAAC GACCCGAAAA 

AGIO 4820 4830 48,40 4850 4B6G 

TTTTTCTCTT TCTCTCCTTT CTTTTTCTTC TTCCCTCCCT ATCTAACCCT CCCATGGCAC 
AAAAAGAGAA AGAGAGGAAA GAAAAAGAAG AAGGGAGGGA TAGATTGGGA GGGTACCGTG 

^S70 4BB0 4890 49DO 49 lO 49SO 

CTTCAGACTT TGCTTCCCAT TGTGGCTCCT ATCTGTGTTT TGAATGGTGT TGTATGCCTT 
GAAGTCTGAA ACGAAGGGTA ACACCGAGGA TAGACACAAA ACTTACCACA ACATACGGAA 

493r , ^940 4950 4960 4970 49BO 

TAAATCTGTG ATGATCCTCA TAT6GCCCAG TGTCAAGTTG TGCTTGTTTA CAGCACTACT 
* ATTTAGACAC TACTAGGAGT ATACCGGGTC ACAGTTCAAC ACGAACAAAT GTCGTGATGA 

z»990 5000 5010 5020 503O 5040 

CTGTGCCAGC CACACAAACG TTTACTTATC TTATGCCACG GGAAGTTTAG AGAGCTAAGA 
GACACGGTCG GTGTGTTTGC AAATGAATAG AATACGGTGC CCTTCAAATC TCTCGAT . CT 

5050 5060 5070 50B0 

TTATCTGGGG AAATCAAAAC AAAAAACAAG CAAACAAAAA AAAAA 
AATAGACCCC TTTAGTTTTG TTTTTTGTTC GTTTGTTTTT TTTTT 



; F/GUKET S {pa<$* l of 

10 20 30 ^O 50 60 

GAGCTCTGGA CAAAATTGAG CGCCTATGTG TACATGGCAA GTGTTTTTAG TGTTTGTGTG 

70 BO 90 lOO HO 120 

TTTACCTGCT TGTCTGGGTG ATTTTGCCTT TGAGAGTCTG GATGAGAAAT GCATGGTTAA 

1 30 1 *fO 1 50 160 1 70 1 BO 

AGGCAATTCC AGACAGGAAG AAAGGCAGAG AAGAGGGTAQ AAATGACCTC TGATTCTTGG 

190 200 210 220 230 240 

GGCTGAGGGT TCCTAGAGCA AATGGCACAA TGCCACGAGG CCCGATCTAT CCCTATGACG 

£50 260 270 280 H90 30O 

GAACTCTAAG GTTTCAGCAT CAGCTATCTG CTGGCTTGGT CACTGGCTTG CCTCCTCAGT 


310 320 330 340 350 360 

TTGTAGGAGA CTCTCCCACT CTCCCATCT6 CGCGCTCTTA TCA6TCCTGA AAAGAACCCN 

370 3BO 390 400 *f 10 420 

TGGCNAGCCA GGAGCNAG6T ATTCNTATCG TCCTTTTCNT CCTCCTNGCC TCACCTNGTT 


430 **£*0 450 460 ^70 ^80 

GNTTTTTAGA TTGGNCTTNG NAACCAAATT TGTATGCTGG CCTCCA6GAA ATCTGGAGCC 

490 500 510 520 530 540 

TGGCGCCTAA ACCTTGGTTT AGGAAAGCAG GAGCTATTCA GGAAGCAGGG TCCTCCAGGG 

550 560 570 5B0 590 600 

CTAGAGCTAG CCTCTCCTGC CCTCGCCCAC GTGCGCCAGC ACTTGTTTCT CCAAAGCNAC 

610 620 630 640 650 660 

^TAGGCAGGCG TTAGCGCGCG GTGAGBGGAG GGGAGAAAAG GAAAGGGGAG GGJ3AGGGAAA 

670 680 690 700 710 720 

AGGA(3GTGGG AAGGCAAGGA GGCCGGCCNG GTGGGGGCGG GACCCGACTC GCANNAACTG 

730 740 750 760 770 7BO 

TTGCATTTGC TCTCCACCTC CCAGCGCCCC CTCCGAGATC CCGGGGAGCC AGCTTGCTGG 

790 800 810 820 830 840 

GAGAGCGGGA ACGGTCCGGA GCAAGCCCAG AGGCAGAGGA GGCGACAGAG GGAAAAAGGG 

850 . B60 870 880 890 900 

CCCNAGCTAG CCGCTCCAGT GCTGTACAGN AGCCGAAGGA CGCACCACGC CAGCCCCAGC 

91 0 920 930 940 95G 960 

CCGGCTCCAG CGACAGCNAA CGCCTCTTGC ANGCGTTCGA AGCCGCCGCC CGGAGCTGCC 

970 980 990 lOOO 1010 1020 

CTTTCCTCTT CGGTGAAGTT TTTAAAAGCT GCTAAAGACT CGGAGGAAGC AAGGAAAGTG 

1030 104O 1050 1060 1070 lOBO 

CCTGGTAGGA CTGACGGCTG CCTTTGTCCT CCTCCTCTCC ACCCCGCCTC CCCCCACCCT 

1090 1100 1110 1120 1130 1140 

GCCTTCCCCC CCTCCCCCGT CTTCTCTCCC GCAGCTGCCT CAGTCGGCTA CTCTCAGCCA 

1150 1160 1170 1180 1190 1200 

ACCCCCCTCA CCACCCTTCT CCCCACCCGC CCCCCCGCCC CCGTCGGCCC AGGGNT6NCA 

1210 . . . .... 1220 v -1230 1240 1250 . 1260 : - 

GNCCGAGTTT GCAGAGAGGT AACTCCCTTT : VEGCTGCGAGC GGGCGAGNCT ABCTGCAC&T^^^ 

........ LuL. -'t i 




1 e7o 

1S80 

1290 

1300 

1310 

1380 


TGCAAAGAAG 

GCTCTTAGGA 

GCAGGCGACT 

GGGGAGCGGC 

TTCAGCACTG 

CAGCCACGAC 


1 330 

134 0 

1350 

1360 

1370 

13BO 


CNGCCTGGTT 

AGGCTGCACG 

CGGAGAGAAC 

CCTCTGTTTT 

CCCCCACTCT 

CTCTCCACCT 


i 390 

1400 

1410 

1420 

1430 

1440 


CCTCCTGCCT 

TCCCCACCCC 

GAGTGCGGAG 

CCAGAGATCA 

AAAGATGAAA 

AGGCAGTCAG 


1 <4 ^0 

1 4 60 

3 470 

14 GO 

1490 

1 500 


GTCTTCAGTA 

GCCAAAAAAC 

AAAACAAACA 

'aaaacaaaaa 

AGCCGAAATA 

A A AG AAA AAG 


* 5 1 0 

1 580 

1530 

154 0 

1550 

1560 


AT AATAACTC 

AGTTCTTATT 

TGCACCTACT 

TCAGTGGACA 

CTGAATTT6G 

AAGGTGGAGG 


1570 

15B0 

1 590 

1600 

161C 

-1620 


ATTTTGTTTT 

TTTCTTTTAA 

GATCTGGGCA 

TCTTTTGAAT 

CTACCCTTCA 

AGTATTAAGA 


1 630 

1640 

1650 

1660 

1670 

16BO 


GACAGACTGT 

GAGCCTAGCA 

GGGCAGATCT 

TGTCCACCGT 

GTGTCTTCTT 

CTGCACGAGA 


A c:90 

1 700 

1 7 1 0 

1 7.20 

1730 

174D 


CTTTGAGGCT 

GTCA6AGCGC 

TTTTTGCGTG 

GTT6CTCCCG 

CAAGTTTCCT 

1 CTCTGGAGC 


1 *?50 

1760 

1770 

17SO 

1790 

1 BOO 


TTCCCGCAGG 

TGGGCAGCTA 

GCTGCAGCGA 

CTACCGCATC 

ATCACAGCCT 

GTTGAACTCT 


3 & 1 0 

1820 

1 830 

1 840 

1B50 

1 860 

5 

TC'TfSAGCAAG 

AGAAGGGGAG 

GCGGGGT AAG 

GGAAGTAGGT 

GGAAGATTCA 

GCCAA3CTCA 


1890 j r Z :;. 

6_Gu AT 5 GAA GTG CAG TTA GGG CTG GGA AGG GTC TAG CCT CGG CCG CCG TCC AAG ACC T*-C 
<Et Glu Vai Gin Leu Gly Leu Gly Arg Va 1 Tyr Pro Arg Pro Pro 9er Lys Thr Tyr 

1950 lVcO 
CG.'- ftA GCT TTU CAG AAT CTG TTC CAG AGC GTG CGC GAA GTG ATC CAG A AC CCG GG~ CCC 
Arg Gly Ala Phe Gin Asn Leu Phe Gin Eer Val Arg Glu Val lie Gin Asn Pro Gly Fro 

20 lO 2<.«*0 
AG3 CAC CCA GAG GCC GC6 AGC GCAG^> CCT CCC GGC GCC AGT TTG CTG CTG CTG CAG CAG 
Arg His Pro Glu Ala Ala Ser Ala^]^ Pro Pro Gly Ale. Ber Leu Leu Leu Leu Gin Gin 

807C 2lOO 
CAS CAG CAG CAG CAG CAG CAG CAG CAG CAG CAG CAG CAG CAG CAG CAG CAG CAG CAG CAG 
Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin GlnGln Gin Gin Gin 

2130 E160 
CAG CAG CAA GAG ACT AGC CCC AGG CAG CAG CAG CAG CAG CAG GGT GAG GAT GGT TCT CCC 
Gin Gin Gin Glu Thr Ser Pro Arg Gin Gin Gin Gin Gin Gin Gly Glu Asp Gly Ser Pro 

£190 2220 
CAA GCC CAT CGT AGA GGC CCC ACA GGC TAC CTG GTC CTG GAT GAG GAA CAG CAA CCT TCA 
Gin Ala His Arg Arg Gly Pro Thr Gly Tyr Leu Vsl Leu Asp Glu Glu Gin Gin Pro Ser 

2250 8880 
CAG CCG CAG TCG GCC CTG GAG TGC CAC CCC GAG AGA GGT TGC GTC CCA GAG CCT GGA GCC 
G3n Pro Gin Ser Ala Leu Glu Cys His Pro Glu Arg Gly Cys Val Pro Glu Pro Gly Ala 

2310 2340 
GCC GTG GCC GCC AGC AAG GGG CTG CCG CAG CAG CTG CCA GCA CCT CCG GAC GAG GAT GAC 
Ale Val Ala Ala Ser Lys Gly Leu Pro Gin Gin Leu Pro Ala Pro Pro Asp Glu Asp Asp 

• ■ - "V. ■ 


if i\ Gn t-t.-r c.ca icr acg ttg ut. ctg ctg ggc ccc act ttc ccc ggc tta agc agc tgc 
A),-' Ala Fit Snr Thr Lfli Ser Leu l.pu Gly Pro Ihr Phe Pro Gly Leu Ber Ser Cys 

P'»3'i 24 60, 

t« r nrr gap ctt aaa cac atc ctg agc gag gcc agc acc atg caa ctc ctt cag caa cag 

br , A]? Arvp Lpu Lys Arrp lie Leu Ser Glu Ale Ser Thr Met Gin Leu Leu Gin Gin Gin 

f?'4 9o 2520 

t f-.f, i..r»ri raa (51 A G"!A ICC GAA GGC AGC AGC AGC GGG AGA GCG AGG GAG GCC TCG GGG GCT 
Gin Gin Glu A3? VfI ?pr Glu Gly St?r Ser Ser Gly Arg Ala Arg Glu Ala Ser Gly Ala 

2580 

r* r ATT ICC TCC A AG GAC- AAT T AC TTA GGG GGC ACT TCG ACC ATT TCT GAC AAC GCC AAG 
Pi i« ihr ; *r-r Scr l y& A«^p Asm Tyr Leu Gly Gly Ihr Ser Thr lie Ser Asp Asn Ala Lys 

TAJ O 2640 
: if; in-] mA^ r,r^ rip. TI G GIG TCC ATG GGC CTG GGT GTG GAG GCG TTG GAG CAT CTG 
p I'" f. ys L Al,- V-'al Ser V?l Sei Mft Gly Leu Gly Val Glu Ala Leu Glu His Leu 

"I* «?/.«70 2700 

4» AG I CCA GGG GAA CAG CI T CGG GGG GAT TGC ATG TAC GCC CCA CTT TTG GGA GTT CCA CCC 
yj. Sr-'- l-n.- Gly Glu Gin Leu Arg Gly A=p Cys Met Tyr Ale Pro Leu Leu Gly Val Pro Pro 

f*l F7?" 2760 

nr.! GIG GCT CCC ACT CCT TGT GCC CCA TIG GCC GAA TGC AAA GGT TCT CTG CTA GAC GAC 
lit M 1 VfiJ A3a F ' rCl ^ hr ^' rc ' c y- Pre Leu Ala Glu Cys Lys Gly Ser Leu Leu Asp Asp 

E 279A 2B20 

-r. g«;a ggc AAG agc act GAA gat act GCT gag TAT TCC CCT TTC AAG GGA GGT TAC acc 
~1 ] a GJ Y Lvs 5cr Thr Glu A=p Ihr Ala Glu Tyr Ser Pro Phe Lys Gly Gly Tyr Thr 

O 28B0 
[Jj A?V* G'Hl- CTA GAm GiC3L GAG AGC CTA GGC TGC TCT GGC AGC GCT GCA GCA GGG AGC TCC GGG 
Zl !. j-r. Glv I. f?u GIli Gly Glu 5ft Leu Gly Lys Ser Gly Ser Ala Ala Ala Gly Ser Ser Gly 

O P<? 1 O 2940 

M.'A C Tl R A A CTG CCG TCT ACC CTG TCT CTC T AC AAG TCC GGA GCA CTG GAC GAG GCA GCT 
Tt-.i I c:« niu I. en Fro ?pr Thr Leu Ser I.pli Tyr Lys Ser Gly Ala Leu Asp Glu Ala Ala 

8<?-7 0 3000 
GCG TAC CAG AGT CGC GAC TAC TAC AAC TTT CCA CTG GCT CTG GCC GGA CCG CCG CCC CCT 
Al-> Tyr Gin 5~r. Arq Asp T yr Tyr Asn Phe Pro Leu Ala Leu Ala Gly Pro Pro Pro Pro 

3030 30&0 
CU3 CCG CCT CCC CAT CCC CAC GCT t GC ATC AAG CTG GAG AAC CCG CTG GAC TAC GGC AGC 
Pi ' Pro Pro Prr- Hie Pre- His Ale Arg lie Lys Leu Glu Asn Pro Leu Asp Tyr Gly Ser 

30^c» 3120 
l-i/C TGG nc.G GCT Q£G GCG GCG CA<J-TGC cS^-CTTm r GGG GAC CTG GCG AGC CTG CAT GGC GCG 
Ai* T. p Ala Ala /Vb; Ala Ala G4*\ Cys Arqi^T" Gly Asp Leu Ala Ser Leu His Gly Ala 

3180 

GGT GCA GCG GGA CCC GGT TCT GGG TCA CCC TCA GCC GCC GCT TCC TCA TCC TGG CAC ACT 
H\y Ala AJfv Gly r>"r, Gly Ber Gly Sft Pro Ser Ala Ala Ala Ser Ser Ser Trp His Thr 

3?AO 324C 
TTC AC A GCC GAA GAA GGC CAG TTG TAT GGA CCG TGT GGT GGT GGT GGG GGT GGT GGC 
Lou Phe Thr Ala Glu Glu Gly" Gin Leu Tyr Gly Pro Cys Gly Gly Gly Gly Gly Gly Gl^ 


3E7C 3300 

GGC GGC GGC GGC GGC GGC GGC GGC GGC GGC GGC GGC GGC GGC GGC GGC GGC GAG GCG GGA 
Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Glu Ala Gly 

3330 3360 
GCT GTA GCC CCC TAG GGC TAG ACT CGG CCC CCT CAG GGG CTG GCG GGC CAG GAA AGC GAC 

Ala Val Ala Pro Tyr Gly Tyr Thr Arg Pro Pro Gin Gly Leu Ala Gly Gin Glu Ser Asp 

3390 34EO 

TTC ACC GCA CCT GAT GTG TGG TAC CCfr GGC- GGC "ATG GTG AGC AG A GTG CCC TAT CCC AGT 

Phe Thr Ala Pro Asp Val Trp Tyr Pro Gly Gly Met Val SeV Arg Val Pro Tyr Pro Ser 

3450 3^80 
CCC ACT TGT GTC AAA AGC GAA ATG GGC CCC TGG ATG GAT AGC TAC TCC CGG GAA CCT TAC 

Pro Thr Cys Val Lys Ser Glu Met Gly Pro Trp Met Asp Ser Tyr Ser Arg Glu Pro Tyr 

35 lO 3540 

GGG GAC ATG CGT TTG GAG ACT GCC AGG GAC CAT GTT TTG CCC ATT GAC TAT TAC TTT CCA 

Gly Asp Met Arg Leu Glu Thr Ala Arg Asp His Val Leu Pro lie Asp Tyr Tyr Phe Pro 

3570 360O 
CCC CAG AAG ACCTBC CTG ATC TGT GGA GAT GAA GCT TCT GGG TGT CAC TAT GGA GCT CTC 

Pro Gin Lys Thr Cys Leu lie Cys Gly Asp Glu Ala Ser Gly Cys His Tyr Gly Ala Leu 

3630 3660 

ACA TGT GGA AGC TGC AAG GTC TTC TTC AAA AGA GCC GCT GAA GGG AAA CAG AAG TAC CTG 

Thr Cys Gly Ser Cys Lys Val Phe Phe Lys Arg Ala Ala Glu Gly Lys Gin Lys Tyr Leu 

3690 37£C 

TGC GCC AGC AGA AAT GAT TGC ACT ATT GAT AAA TTC CGA AGG AAA AAT TGT CCA TCT TGT 

Cys Ala Ser Arg Asn Asp Cys Thr He Asp Lys Phe Arg Arg Lys Asn Cys- Pro Ser Cys 

375G 37BO 

CGT CTT CGG AAA TGT TAT GAA GCA GGG ATG ACT CTG GGA GCC CGG AAG CTG AAG AAA CTT 

Arg Leu Arg Lys Cys Tyr Glu Ala Gly Met Thr Leu Gly Ala Arg Lys Leu Lys Lys Leu 

38 lO 3SAO 

6GT AAT CTG AAA CTA CAG GAG GAA GGA GAG GCT TCC AGC ACC ACC AGC CCC ACT GAG GAG 

Gly Asn Leu Lys Leu Gin Glu Glu Gly Glu Ala Ser Ser Thr Thr Ser Pro Thr Glu Glu 

3B70 3900 

ACA ACC CAG AAG CTG ACA GTG TCA CAC ATT GAA GGC TAT GAA TGT CAG CCC ATC TTT CTG 

Thr Thr Gin Lys Leu Thr Val Ser His He Glu Gly Tyr Glu Cys Gin Pro He Phe Leu 

3930 3960 

AAT GTC CTG GAA GCC ATT GAG CCA GGT GTA GTG TGT GCT GGA CAC GAC AAC AAC CAG CCC 

Asn Val Leu Glu Ala lie Glu Pro Gly Val Val Cys Ala Gly His Asp Asn Asn Gin Pro 

3990 4020 

GAC TCC TTT GCA GCC TTG CTC TCT AGC CTC AAT GAA CTG GGA GAG AGA CAG CTT GTA CAC 

Asp Ser Phe Ala Ala Leu Leu Ser Ser Leu Asn Glu Leu Gly Glu Arg Gin Leu Val His 

4050 *OBO 

GTG GTC AAG TGG GCC AAG GGC TTG CCT GGC TTC CGC AAC TTA CAC GTG GAC GAC CAG ATG 

V*l Val Lys Trp Ala Lys Gly Leu Pro Gly Phe Arg Asn Leu His Val Asp Asp Gin Met 

GCT GTC ATT CAG TAC TCC TGG ATG GGG CTC ATG GTG TTT GCC ATG GGC TGG CGA TCC TTC 

Ala Val He Gin Tyr Ser Trp Met Gly Leu Met Val Phe Ala Met Gly Trp Arg Ser Phe 


F~t 61 u/e^r (page ^r) 


4170 4200 
ACC AAT GTC AAC TCC AGS ATG CTC TAC TTC GCC CCT GAT CTG GTT TTC AAT GAG TAC CGC 
Thr Asn Val Asn Ser Arg Met Leu Tyr Phe Ala Pro Asp Leu Val Phe Asn Glu Tyr Arg 

4230 4260 
ATG CAC AAG TCC CGG ATG TAC AGC CAG TGT GTC CGA ATG AGG CAC CTC TCT CAA GAG TTT 
Met His Lys Ser Arg Met Tyr Ser Gin Cys Val Arg Met Arg His Leu Ser Gin Glu Phe 

4290 . - - 4320 

GGA TGG CTC CAA ATC ACC CCC CAG GAA TTC CTG TGC ATG AAA GCA CTG CTA CTC TTC AGC 
Gly Trp Leu Gin lie Thr Pro Gin Glu Phe Leu Cys Met Lys Ala Leu Leu Leu Phe Ser 

4350 4380 
ATT ATT CCA GTG GAT GGG CTG AAA AAT CAA AAA TTC TTT GAT GAA CTT CGA ATG AAC TAC 
He He Pro Val Asp Gly Leu Lys Asn Gin Lys Phe Phe Asp Glu Leu Arg Met Asn Tyr 

44 10 4440 
ATC AAG GAA CTC GAT CGT ATC ATT GCA TGC AAA AGA AAA AAT CCC ACA TCC TGC TCA AGA 
He Lys Glu Leu Asp Arg lie He Ala Cys Lys Arg Lys Asn Pro Thr Ser Cys Ser Arg 

4470 4 5 Cm!) 

CGC TTC TAC CAG CTC ACC AAG CTC CTG GAC TCC GTG CAG CCT ATT GCG AGA GAG CTG CAT 
Arg Phe Tyr Gin Leu Thr Lys Leu Leu Asp Ser Val Gin Pro He Ala Arg Glu Leu His 

4530 4560 
CAG TTC ACT TTT GAC CTG CTA ATC AAG TCA CAC ATG GTG AGC GTG GAC TTT CCG GAA ATG 
Gin Phe Thr Phe Asp Leu Leu He Lys Ser His Met Val Ser Val Asp Phe Pro Glu Met 

4590 ■ 4620 

ATG GCA GAG ATC ATC TCT GTG CAA GTG CCC AAG ATC CTT TCT GGG AAA GTC AAG CCC ATC 
Met Ala Glu He He Ser Val Gin Val Pro Lys He Leu Ser Gly Lys Val Lys Pro He 

465C 4690 
TAT TTC CAC ACC CAG TGA AGC ATT GGA AAC CCT ATT TCC CCA CCC CAG CTC ATG CCC CCT 
Tyr Phe His Thr Gin End . . . 

4690 4700 4710 4720 4730 4740 

TTCAGATGTC TTCTGCCTGT TATAACTCTG CACTACTCCT CTGCAGTGCC TTGGGGAATT 

4750 4760 4770 47BO 4790 4BOO 

TCCTCT ATTG ATGTACAGTC TGTCATGAAC ATGTTCCTGA ATTCTATTTG CTGGGCTTTT 

4810 4B2C) 4830 4840 4850 4860 

TTTTTCTCTT TCTCTCCTTT CTTTTTCTTC TTCCCTCCCT ATCTAACCCT CCCATGGCAC 

4B70 4BBO 4890 4900 4910 4920 

CTTCAGACTT TGCTTCCCAT TGTGGCTCCT ATCTGTGTTT TGAATGGTGT TGTATGCCTT 

4930 4940 4950 4960 4970 4980 

TAAATCTGTG ATGATCCTCA TATGGCCCAG TGTCAAGTTG TGCTTGTTTA CAGCACTACT 

4990 5000 50 IO 5020 5030 5040 

CTGTGCCAGC CACACAAACG TTTACTTATC TTATGCCACG GGAAGTTTAG AGAGCTAAGA 

5050 5060 5070 5080 

TTATCTGGGG AAATCAAAAC AAAAAACAAG CAA AC AA AAA AAAAA 


210 3HO 330 34 O 350 360 

GCGGCT7CGAAGCCGCCGCCCAGAAGCTGCCCTTTCCTCT7CGGTGAAGTTTC7AAAAGC 


370 3SO 390 AGO HlO 4£0 

T GC GG G A G A C TC G 3 A GG A A G C G A AG A A A 8 78 7 CC GG TAG G A C T A C B ACTGCC7TT G T CC T 


iO £0 GO AO 50 60 

AATTCGGGAABGATCBAGCAAACCAGGAAAG7AAGGATGGAGATCCTAGGAGAGTGTCCA 

*70 60 fO 100 110 ISO 

T G C C T CG A A A GG A G C C C A CCA A AG A T G A A C T G 7 TGC A T T TGC T T T CC ACCT CCC AG C G CC 

130 1-+0 150 16L> 170 180 

C C C 7 C G 3 A G A T C C C T A G G A 3 C C A G C C T G C " G G G A G A A C C A G A 3 G G T C C G G A G C A A A C C G 

J«?0 BOO 210 Ec-0 EGO £40 

G A GG C7 G A GAG G G C A T C A 6 A GG GG A A A AG A C 7 G A G T T A GC C ACT CC A GT G CCA T AC A G A A 

Q 

'Zl 250 260 S70 SBO E90 30O 

4 s GCTTAAGGGACATACCACGCCAGCCCCAGCCCAGCGACAGCCAACGCCTGTTGCAGAGC-G 

PJ 

m 

ru 
~ 

O 
RJ 

f~% 4 30 4-4 O h50 460 470 4SO 

^ CGT-CCTCCTACCCCTACCCCTCCTGGGTCCCC7C:"CCC7GAGCGGAC-rAGGCAGGCTTC 

o 

O 490 500 53 0 520 530 . 540 

C T G GCC A GC C C T C T C CC CT AC A CCA C C A G C7 C T G CC A GC C AG T T 7 GC AC AGs A GG T A AC T C 

550 560 570 5SO 590 600 

CCTTT GGCTGAAAGCAGACGAGCTTGT7GCCCATTGGAAGGGAGGCTT 7TGGGAGCCCAG 

610 6E0 630 640 650 660 

AG A CTG AG G A G C A A C A G C A CGC 7G G AG A G7 C CC 7 G A T TCC AG G T T C 7 CCC CC C-T GC A C C 7 

670 680 690 700 710 720 

C C 7 A C T G C C C G C" C C C 1" C A C C C T G T G 7 G T G C A G C "TA G A A T 7 G A A A A G A T 3 A A A A G A C A G T T 

7 30 7^0 750 "60 77 0 790 

GGGGCT7CAG7AGTCGAAAGCAAAACAAAAGCAAAAAGAAAACAAAAAGAaAATAGCCCA 

790 GOO BIO 8c0 £3<* 3*0 

G T T CT T A T T T GC A CC 7 G C T T C AG7 G -3 AC A T TG AC T T T GG A A G G C A G A3 A A 7 7 7 TCC T TC C 


050 S60 B7C GGO B90 900 

CCCCAGTCAAGCTTTGAGCATCTTTTAATC7GTTCTTCAAG7A7T7AGGGACAAAC7G7G 


1 


9 10 vP(j 930 vhO 950 960 

A a £ : c 7 AGC & 3 £ GC A G A T C C T G 7 CT ft G CG CG T 3 C C T T CC 7 TT ft Cft GG A G A C T T 7 G A G 3CT A 


^70 980 990 1000 lOiO 1030 

TCTGGGCGCTCCCCCCCC7CCCTGCAAGTTT7CT7CCC7G6AGCTTCCCGCAGG7GGGCA 


10?0 iO?0 1.060 1070 1030 

G C "I A G C T 3 C A 3 A TACT A C A 7 C A; T C A G T C A G T A G A ft C 7 C 7 T C A G A G C A A S A G A C G A G G A B G 


1090 ] iOO IliO 1120 1120 1140 

C A G 3 A 7 A AG G G A A T T CG GT G G A AGC T AG AG AC A A G C 7 A A AG G A 7GG AG GTG C A 37 T AGG G 

ME-teiu^cO GlnLe»»G3 y 

j 1 50 1 1 60 1 1 70 1 3 GO 1 1 90 1 200 

CTGGGAAGGGTCTACCCACGGCCCCCGTCCAAGACCTATCGAGGAGCBTTCCftGAATCTG 
LeuGlyArgValTyrPrciAroPrc.FToSsrLy5ThrTyrArG61yAlflPheQilnAsnLBU 

iP]0 1E£0 1230 1240 1850 3 360 

T7CCAGAGCG7GCGCGAAGCGA7CCAGAACCCGGGCCCCAGGCACCC7GAGGCCGC7AGC 
Phe ! 3 1 nSer V ? 1 Ar o 8 J u A leUeGl nAsnPr o 3 3 yPr oAr qHi =>Pr o G I u A 1 a A 1 a Ber 

1270 1280 3 290 1 300 1310 1 320 

A T A GC A C C T C C" C G G T G C C T G TT T AC AGC A G C G G C A G G A C AC ^ A 5 C C C CC GG C GG'CGGCG B 
I leAlaProProG'J yft] s C v s L e u G 1 n G I n A r c* G 1 n 3 .1 uThrSer PrcArqAr ^ArgAro 

1330 1940 1250 1 3oO 1 370 1380 

CG G C AGC A GC ACC C T G A G S A 7 GG C 7C 7 CC T C A A G C C C A C A T C AG ft C 3 C A C C AC A 6 G C 7 A C 
firgGinGlriHiEPrcGluAEpBlySefProGlriAlaHi s-I 1 e?Av- gG 1 yThr 7 hv- G 1 yTyr 

i 390 1 400 1 4 1 0 1 4EO 3 h30 1 440 

C T GGCCC T GB AG GAG G A A C AG C AG C CTT C AC A G C A GC ABT C AG CCT CC G AG GGCCACCCT 
Leu A 1 aLeuG 1 uG 1 uG I uG 1 nG 1 nProSerGl nG 1 nG 1 nBer A I aSerG 1 u G I yH i sPr o 

1450 1460 1470 3.4BO 1490 1500 

GAGAGCGCCTGCCTCCCGGAGCCTGGAGCTGCCACGGCTCCTGGCAAGGGGC j GCCGCAG 
GlLiSErGlYCysLeuFToGluProGlyAlflAleThrAlaProGlyLysGiyLeuProGln 

151 O 3 530 1530 1540 3 550 _ _ 

C AGC C A C C AG CT C C 7 CC A G ATC AGG AT G AC T C AGCTGC C CC A7 C C ACG 7 T G TC C C T ACT 13 
G 1 nFrc Pre- A 1 aProFr o AspG 1 nAs-p Asp SerAl a A 1 aPr oSer Th™LeuSsr LeuLeu. 

1570 1530 1590 16 CO 1610 16EO 

G GC CC C AC T T T CC C A 3 G C T T A A GC A G C T G C TCC G C A G A C A T T A A AG A C A T CCT G AG CG A G 
GivProThrPh£PrDbIyt.euSerserCvs3e--A2aAt[; I lsLysAsp I leLeaSerGlu 

j 630 16^0 1650 1660 3 670 1680 

GCC33CAaCA7GCAACTTCTTCAGCAGCAGL;AGCAACA3"AACA^C:AGCAGCA3CABCAG 
A 2 c- G ] y Tr-r h-- 1 G I nl. euL euG 1 nG 3 nG 1 nG i nG 3 n3 1 nG 1 nG I nG 1 nG i n£ 1 nG 1 nG 1 n 

1690 1700 17iO 1720 1730 1740 

C A G C A G C A G C ■ : i G C A G C A A C A 3 C A G C A S G A G 3 T A A 7 ft T C C G A A G £ C A G C A G C A G C G T G A G A 
Gj nG:; v.G'irvB; nG 1 nG InGl nG lr.GinGI uVal 3 1 £■ S -h r G 3 u G 1 v S erSe r 9 .f r V & 1 fir g 

1750 1760 1770 1730 ■ 790 1300 

GCAAGGGAGGCCACT6GGGCTCCCTCTTCC7CCAAGGATAG "7 ACCT AGGGGGCAATTCG 
A 1 a A r g G i : j A I e 7 n r G 1 y A I a P r o £ e? r S e r 3 c? LysAs p S e r 7 y r L e u 8 I y G ] y AsnSer 
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iSjo 1320 1330 13-4 0 1S5 _' 

ACCATATC73ACAG7&CCAAGGAGTTGTGTAAAGCAGTG7CTGTGTCCA7GGG>o : TGt^T 
ThrlleSerAspSerAlrLysSiuLeuCysLysA) 1 Ser Va 1 Ser hett* I yLeuG 1 y 

18?:» 1S80 1S90 1900 391 °_ _ 1920 

GT G C- A A G C A C T GG A AC AT C T G A STC C AS BGG A GC AGC T T CG-3 BG CG A C 7 GU AT T ACSCG 
y a iG3 uAI sLeuEiuHi sLeuBer Pr oG 1 vG i u.G 1 nL e?u Ar Qd 1 yAspCyE^etTyrAla 

19^0 1950 3.960 1970 1930 

TCGCTCCTGeCAGG-TCCACCCGCCGTSCGTCCCACTCCTTGTGCGCCTCTGGCCBAAJBC 
Be-rLrpM.LeuG ] y G 3 yFroPr c« A i &V& 1 ArqProThr ProCysA 1 aProLeuA 1 aGl uuys 

i99o aooo 5010 2020 203c 

C.AAGGTCTTTCCCTGGACGAAGGCCCGGGCAAAGGCACTGAAGAGACTGCT&AGTA i Tsjl; 
LveG 1 vLeuSsrLeuAspS I uG ] yProGl yLysS 1 v7 hr Li 1 ub 1 uTh r A 1 *.G 1 uT yr Ser 

PD5C» 2060 2070 2030 209C 21 OO 

TCTTTCAAGGGAGGTTACGCCAAAGSGTTGGAAGGTGAGAGTCTGGGCTGCTCTGGCAliC 
Ser Pn&Ly sG 1 yG 1 yTyr A 1 s-LysG 3 vLeuGl uG 1 yG 1 ubfrrLeuGl yCysSer G j ySer 

£110 2 120 2130 2l**0 2150 2160 

A G T j3 A A GC- ^ GGT AG CT C T G G G A C AC T T G AG ATC CC G T C CT C A C T GT C T CT GT A T A A uT C T 
S e r G 1 Li A I <~ G 1 / S e rSerG 1 y ThrLeuGluIleP r oSe r S srLe u S.e r L euTyrLysBer 

2170 2 1 SO 2190 2200 2210 ^ 2220 

G G A G CAS T A G A C G A GGC A6C AG C A T AC C A 3 A A TC6CG A C T A C T A C A A C TT T CC \.i CT C G CT 
Rj Y Al£»y«l A=^pG3 uAU-.AiaA] . sTyrGlnABnArqAspTyrTvr AsnKhePrc-LeruAl a 

2230 EE'^0 8250 2260 2270 22SO 

CTGTC CCGCCCGCCGCACCCCCCGCCCCCT ACCCATCCACACGCCCGCATCAAECTGBAG 
LeuS!-TGlyPv-DPrcHiEPrc.ProPrc.PrcThrHisProF-:a&AlaArgI ielysLeuGlu 

2290 ■ 2300 2310 2320 2330 23<40 

AACCC^.TCGGACTACGGCAGCGCCTGGGCTGCGGCGGCAGCBCAATGCCGCTATGGGGAC 
AuTiPrcSsrAEpTyrG) ySerAl sTr p A 1 a A 1 aA 1 a A 3. a.Al eG 1 nCysAr gTy rtfl yAsp 

2350 2360 2370 2320 239C> 2<400 

TTGGCTAGCCTACATGGAGGGAGTGTAGCCGGACCCASCACTGGATCGCCCCCAGCCACC 
LeuAlaSerLeuHiBGlyGlySerValAlaGIyProSerThrGlyberProPrpAlaThr 

2h-10 2^-20 2A30 . 2h<4 0 2^50 2<460 

GCCTCTTCTTCCTGGCATACTCTCTTCACAGCTGAAGAAG3CCAATTATATGGG3CAGGA 
AlaSerSerSerTrpHiBThrLeuPheThrAiaGluGluGlyGlnLeu i yrGlyPrc-Gly 

2*>-70 24 80 2^9 0 2500 2510 2520 

C - G C G 3 G G G C G G C A G C A G T A G CC C A A G C G A T G C T G G B C C ~ G ~ A 3 C C C C C T A T G G C T A C A C T 
G 1 y Q 3 vG 1 yG } ySerSerSer Pr oSer Asp Al aS 1 yPrc-Va I AlaPr oTyrGly I yrThr 

2530 - 2540 2550 2560 2570 2530 

C G G Z C C C L T C A G G G B C T GG C A A G C C A G G A B G G 7 G A C T T C 7 C T G C C T C T" B A A G 1 3TSGTAT 
Ar TiPrr-ProGJ nG 1 y L «u A laSerG 1 r.3 1 u.G 3 yAspPhe -Sisr A 1 aSerD i 3 TrpTyr 

£:590 2600 2610 2620 8630 26*0 

CCTGGTGBAGTTGTGAACAGAGTCCCCTATCCCAGTCCCAnrTGTGTTAAAAGVCBAAATG 
PreG:. yGiyvaiVal A^nArcVa I ProTyrProSerPrcSsrCysVa 3 L ysSsrGl uMet 

26 50 2660 E670 2690 2690 E7HO 

G 3 A C CT T G GAT G S A G A A CT A CTC CGG A CC T T A T G GG G A C A TGCG T T T G G AC A 3 T AC C AGG 
G ) / P r o T r d M e- t G 1 u A s v- T y r 3 e? r G 1 y P r o 7 yr G 1 yAspMetA r gLeuA s p S er T h r Ar g 


1 7 


a^ ; i-ry £730 £730 Ef^O • E750 £760 

G AC C AC G ~ 7 7 7 ACC C A 7 CG AC 7 A 7 7 A C TTCC C A CC CC AG A A G A CC 7 C C C 7 G A 7 C T G T 3G A 
A = r > H 3 = a ... F r o I } e A soT y r T y r ' r> p r o P r o G j n L y s T h r C y ELei' I 1 e C/sG 1 y 

E77C> E7EO E790 ESOO 2810 EBEO 

GA7GAAGC 7 7C7GG77G7CAC7 ACGGAGC7C7CAC77B7GGCAGC7GCAAGG7C77C77C 
AsoGijAlaGerGl vCysHisTyrGlyAiaLeuThrCysb] yBerCysLvsVslPhePhe 

£830 EB40 £850 £860 E&70 EGSO 

A A A A G AG C T GCG G A AG GG A A AC AG A A S T AT C T A T G7 GC C A G C A G A A A 7 G A 7 7 G C ACC A 7 T 
L y s A f g A 1 a A 3. a G I u G I v L. y &G 1 n L y e T y r L u C y sAlsfce-rft r g AsnAs p C y s Th vile 

E890 £900 E910 E9EO E930 £94 0 

G AT A A A T TT CG G A GG A A A A A T T GT C CATC 67 GT CG T C-T CC GG A A A T GT T A T G A A G C AG GG 
A e p L v PI"- e A rqArgL y s A snCysP r d S er C y e- ArgLeuArgLySSy sT y r G 1 u A 2 a G 1 y 

8950 E960 £970 E9SO E990 3000 

A7GAC7C7G&6AGC7CG7AAGC7GAAGAAAC77GGAAA7C7CAAAC7ACAGGAAGAAGGA 
MetThr L = uGl yA 1 a ArgLy sLeuLy^LvsLeuG 1 yAsnLeuL ysLsuGlnGl u.GluGly 

30 X 0 30EO 3030 30*f O 3050 3060 

GftAAACTCCAGTGCTGGTAGCCCCACTGAGGACCCATCCCAGAAGATGACTGTATCACAC 
G J u. Aei-iSs r 3 e r A 1 a GlySerPr o T h r G JuAs p P r oSerGlnLysMetJhr V aiSerHis 

3070 30S0 3090 3 1 OO 3 1 1 O 31 EO 

A T TG A A GG CT A T G A A T GTC A AC CT AT CTT T C T 7 A A T G T CCT GG A AG C C A T T G AG CC A GG A 
7 I e F i li G 1 > T y r G 1 u C y & G 3 n F' roll e Ph eLeuAs n V alLe i_. G I u A 1 a I 1 e G 1 u P roG 1 y 

3130 3i^O 3150 3160 3170 3 1 SO 

G TBG T GT £3 7 GC CGG AC A7 G AC A AC A AC C A 6CC 7G A7 7 CC7 7 7 G C 7 G CC 7 7 G 7 7 A 7 C7 A 37 
ValV"s)CysAlaGlyHi5AspAsnA5nGlnPrc.A=p3erPhe?AlaAl5iSLiLeLtSsrSgr 

31 90 3EOO 3310 3EEO 3E30 3EAO 

C T C A A CG A 3 C 7 7GGCG AG A G A C AGC 7 7 G 7 A C A 7 G 7 G G 7 C A AG 7 GGGC C A AG GC C 7 7GCC 7 
L e u A = n G 1 Lt L f? u G I y G 1 u Ar q G 1 n L e u V a 1 H i s V a 1 V a 1 L y sTrpAlaLysAIaLe u P r c 

3350 3B6G 3E7C 3E90 ■ 3H90 3300 

GGCTTC:CGCAACTTGCATGTGGATGACCAGATGGCAGTCATTCAGTAT~CCT5GATGGGA 
G3 yPhsArgAsriLeuHisVal AspAspGlnMetAlaVsl X ] eGlnTyrEerTrpM^tGiy 

3310 33 SO 3330 33'-! O 3350 3360 

CT G A 7 GG T A T T T GC C A TG GG T T G G C GG T C C T T C AC T A A T G T C A AC TC T AGG A T G CT CT AC 
L e u !i = t V a. J P h e A 1 = M s t- G 1 v T r p A r g 9 g- r P h f* T h r A s *-> V a 1 A s n S e r Ar g M c- 1 L c? u T v r 

3370 3380 3390 3«+00 3h10 34EO 

TTTGCACCTGACCTGGTT77CAA7GAGTATCGCA7GCACAA6TCTCGAAT6TACAGCCAG 
PhsAI ?Prc.A5pLeuVfIPhE-AEnGU7vTArqM2?tHi5Ly£Ser Ar c Ms t Tyr SerGl n 

-3**30 3-f-iO 3^50 3^oO 347':- 3^ SO 

T & Z G 1* £ A G G h T' G A 3 G C- A C C T T T C T C A A G A £ 7 T T GG A T G G C T C C A G A T & A C C C C C C A G G A A 
C y s I A r q K; r- t Ar g H i s !. . &u E er G 1 n G .1 u Ph eG i y T r p L » l» G 1 n I ■ e T h r P r o G 1 nG 1 li 

3^90 3500 3510 35BO 3530 35^0 

TTCCTGTGCATGAAAGCACTGCTACTCTTCAGCATTATTCCAGTGGATGGGCTGAAAAAT 
Phc-LcuCyEhetLysAi nLeuLeuLenPheSsr I 1 e 1 1 eProVa 1 AspG i yL enLysAsn 

3550 3560 3570 35G0 3590 36O0 

C- A A A A A 1* T C TT T GAT 6 A A C T T C G A A 7 G A A C T A C A T C A A G G A A C T T G A 7 C G C A T C A T T G C A 
G 1 i"i L y e i"( c- F" h e A h p C luLe u A r gMst A s n 7 v rll eLy e-*G1 ut.e u s p A rcil leHeAla 
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3610 3630 3630 3640 3650 3660 

TGCAAAAG^AAAAATCCCACATCCTGCTCAAGGCGCTTCTACCAGC^ 

C y sr. L v ~- A r g L y sAs n P r oThrSe r C y ^ SerArg A r a F* h eT y r G 1 r f L e u ~ r : r Ly=LeuLe ll 

36"0 3680 3690 3700 37 J 0 3780 

i3ATTCTGTGCA»3CCTATTGCAAGA3A3CTGCATCAATTCACTTTToACCTGCTAATCAAG 
As p SsrVa 1 G I r Pr o I 1 sA ] aArgS 1 uLeuH i sG 3 nPheTnrPheAspLeuLeu J 1 eLys 

3730 3740 3750 , 3760 3770 3780 

TCwCATATG3TGAGCGTGGACTTTCCTGAAATGATG3CAGAGATCATCTCTGTGCAAGTG 
SerH sMEtVe 1 SerVs 1 ftspPhePrc-63 uMetMetAJ. aG 1 u 3 1 e I 3 eSerVs 1 G 1 nVs 1 

3790 3BOO 3S10 3880 3830 3S40 

C CC A AG A T C C T T T C T G3 G A A AG TC A GC CC A T G T A T T T C C: AC AC A C AGT G A A G A T T T GC?. A A 
Pr o !_ ■/=. I leLenSerG j yLysValSerProCy?! leSerThrHi E-Be>" G 1 u AspLenB 3 u 

3S50 3S60 3B70 33B0 3S90 3900 

CC7AA7ACCCAAACCCACCTGTTCCCTTTTCAGATGTCTTCTGCCTGTTATATAACTCTG 
Pr o AsnThr 31 nThr H i £.LeuPhePr oPheG 1 nMetberBer A 1 aCys'Tyr I i trThr Leu. 

391C 39E0 3930 3940 3950 3960 

CAC7ACTTCTCTGGCATGGGCCTTGGGGGAAATTCCTCTACTGATGTACAGTCTGTCATG 
HisTyrPheSerGi yMetGl yLeuG I y GI yfisnEerSer ThrAspVal G 1 nSerVa 1 Met 

3970 39 SO 3990 4000 4010 4030 

AACATGTTCllCAAGTTCTATTTCCTGGGCTTTTCCTTCTTTCTTTTTCTTCTTCTCTGC 
AsnMe tPheProLysPheT yrPheLeuG 1 yPheSer F hePneLeLiPheLeuLeiiLeuCys 

4030 4040 4050 4060 4070 . 40 SO 

ctcttttaccctcccatggcacattttgaatccgctgcgtgttgtggctcctgcctgtgt 

Leu Ph e "I" y r Pr o P r o M e t A 1 a H i EPheG i u SerAlaA i aCysCysG 1 ySe r Cy sLeuCys 

4090 4 1 00 4 110 ^ 1 30 41 30 4 1 40 

TTTGAGT7TTGTTGTATTTCTTCAAGTCTGTGATGATCTTCTTGTGGCCCAGT6TCAACT 
P h e G 1 1. j P h C y s=. Cys 1 1 eSerB ~- r SerLe u E n d 

4 3 50 4 1 60 4 1 70 4 1 SO 4 i 90 4300 

G T GCT 7 G7 T 7 AT A GC A CT G T GC T G T G T G CC A A CC A AG C A A AT G 7 T T A C T C ACCT 7 A T G C C 


4310 4330 4230 4340 4350 4360 

A 7 G G C A A G TTT A G A G AGC T A T A AG T AT CT T G GG A AG A A A C A A AC AG A G AG AG T A A A A A A A 
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Figure 7 . Frozen section "of rat ventral prostate stained with antibodies 
(AR-52-3-P) to the AR: peptide NH2-Asp-His-Va!-Leu-Pro-Ile-Asp-Tyr-Tyr- 
Phe-Pro-Pro-Gln-Lys-Thr in a dilution of 1 to 3000 using the avidin-biotin 
peroxidase technique. Androgen receptor is indicated by brown staining of nuclei 
in epithelial cells. Immuno-staining was performed as previously described (60). 
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